Searching and ranking in entity-relationship graphs
نویسنده
چکیده
The Web bears the potential to become the world’s most comprehensive knowledge base. Organizing information from the Web into entity-relationship graph structures could be a first step towards unleashing this potential. In a second step, the inherent semantics of such structures would have to be exploited by expressive search techniques that go beyond today’s keyword search paradigm. In this realm, as a first contribution of this thesis, we present NAGA (Not Another Google Answer), a new semantic search engine. NAGA provides an expressive, graph-based query language that enables queries with entities and relationships. The results are retrieved based on subgraph matching techniques and ranked by means of a statistical ranking model. As a second contribution, we present STAR (Steiner Tree Approximation in Relationship Graphs), an efficient technique for finding “close” relations (i.e., compact connections) between k(≥ 2) entities of interest in large entity-relationship graphs. Our third contribution is MING (Mining Informative Graphs). MING is an efficient method for retrieving “informative” subgraphs for k(≥ 2) entities of interest from an entity-relationship graph. Intuitively, these would be subgraphs that can explain the relations between the k entities of interest. The knowledge discovery tasks supported by MING have a stronger semantic flavor than the ones supported by STAR. STAR and MING are integrated into the query answering component of the NAGA engine. NAGA itself is a fully implemented prototype system and is part of the YAGONAGA project.
منابع مشابه
Learning Parameters in Entity Relationship Graphs from Ranking Preferences
Semi-structured entity-relation (ER) data graphs have diverse node and edge types representing entities (paper, person, company) and relations (wrote, works for). In addition, nodes contain text snippets. Extending from vector-space information retrieval, we wish to automatically learn ranking function for searching such typed graphs. User input is in the form of a partial preference order betw...
متن کاملScalable Link-based Personalization for Ranking in Entity-Relationship Graphs
Authority flow techniques like PageRank and ObjectRank can provide personalized ranking of typed entity-relationship graphs. There are two main ways to personalize authority flow ranking: Nodebased personalization, where authority originates from a set of userspecific nodes; Edge-based personalization, where the importance of different edge types is user-specific. We propose for the first time ...
متن کاملSearching and Ranking in Entity-Relation Graphs
Ranking results to queries over information spaces where the data is represented in a directed graph has been the focus of much recent research work. In this report, we study three ranking paradigms addressing di erent issues in this regard. Entity-Relation(ER) graphs have entities as nodes, with relations between them being represented as edges. The PageRank system for ranking relies on the li...
متن کاملLearning to rank related entities in Web search
Entity ranking is a recent paradigm that refers to retrieving and ranking related objects and entities from different structured sources in various scenarios. Entities typically have associated categories and relationships with other entities. In this work, we present an extensive analysis of Web-scale entity ranking, based on machine learned ranking models using an ensemble of pair-wise prefer...
متن کاملEntity Ranking on Graphs: Studies on Expert Finding
Todays web search engines try to offer services for finding various information in addition to simple web pages, like showing locations or answering simple fact queries. Understanding the association of named entities and documents is one of the key steps towards such semantic search tasks. This paper addresses the ranking of entities and models it in a graph-based relevance propagation framewo...
متن کامل